Generate from image (Buffered)
POST/image_generate
Generate from image (Buffered)
The /image_generate
endpoint is used to communicate with the LLM. Use this endpoint when you want to send an image to a multimodal
LLM and receive a text response, all at once.
See the /image_generate_stream
endpoint if you want to receive a stream of responses from the LLM, token by token.
This endpoint takes a multipart input, with two required fields:
- 'json_data': should contain json data, matching the format used for the
/generate
and/generate_stream
endpoints. - 'image_data': a stream of bytes, representing an image file.
Multipart requests support is built into most common HTTP clients.
To send a batch of requests with the same image, the text field of the json payload can be either a string, or an array of strings. Only one image can be supplied per request - to supply a set of generation requests each to different images, send them in quick succession and rely on automatic batching.
Request​
- multipart/form-data
Body
required
- MOD1
- MOD2
Array [
]
json_data
object
required
JSON generation payload, used in /generate, /generate_stream, /image_generate, /image_generate_stream
text
object
required
Input Text used for ease of users not to have to use the clunky PayloadText. Mapping provided below to convert InputText to PayloadText.
oneOf
string
string
Responses​
- 200
- 400
- 422
- 503
Takes in a JSON payload and returns the response all at once.
- application/json
- Schema
- Example (from schema)
Schema
- MOD1
- MOD2
Array [
]
text
object
required
Input Text used for ease of users not to have to use the clunky PayloadText. Mapping provided below to convert InputText to PayloadText.
oneOf
string
string
{
"text": "string"
}
Bad request
Malformed request body
The server is not ready to process requests yet.